Remove config assumption in Trainer by sgugger · Pull Request #7464 · huggingface/transformers

sgugger · 2020-09-29T23:25:18Z

What does this PR do?

This PR tries to limit the access to model.config in Trainer to the minimum so that it works with regular PyTorch modules (as long as they accept dict inputs and return loss first like our models). The most challenging part was the storing/restoring of the total_flos, which I moved to the newly created TrainerState. It should work as before and be saved along the rest of the training state.

LysandreJik

LGTM!

LysandreJik · 2020-09-30T07:48:09Z

src/transformers/trainer.py

-        assert not getattr(
-            self.model.config, "output_hidden_states", False
-        ), "The prediction loop does not work with `output_hidden_states=True`."
-


Wouldn’t we want to put these lines inside an if statement? The prediction loop still doesn’t work with these outputs right?

Nope it does now since the functions that detach/concat etc. all work on nested list/tuples of tensors :-)

TevenLeScao

LGTM ! Much cleaner this way, thanks!

sgugger added 2 commits September 29, 2020 19:21

Remove config assumption in Trainer

01b0185

Initialize for eval

4b0161f

sgugger requested review from LysandreJik, TevenLeScao and julien-c September 29, 2020 23:25

LysandreJik approved these changes Sep 30, 2020

View reviewed changes

sgugger mentioned this pull request Sep 30, 2020

Distributed Trainer: 2 little fixes #7461

Merged

TevenLeScao approved these changes Sep 30, 2020

View reviewed changes

sgugger merged commit fdccf82 into master Sep 30, 2020

sgugger deleted the trainer_dont_assume_config branch September 30, 2020 13:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove config assumption in Trainer#7464

Remove config assumption in Trainer#7464
sgugger merged 2 commits intomasterfrom
trainer_dont_assume_config

sgugger commented Sep 29, 2020

Uh oh!

LysandreJik left a comment

Uh oh!

LysandreJik Sep 30, 2020

Uh oh!

sgugger Sep 30, 2020

Uh oh!

TevenLeScao left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sgugger commented Sep 29, 2020

What does this PR do?

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

LysandreJik Sep 30, 2020

Choose a reason for hiding this comment

Uh oh!

sgugger Sep 30, 2020

Choose a reason for hiding this comment

Uh oh!

TevenLeScao left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants